Language Encodes Geographical Information
نویسندگان
چکیده
Population counts and longitude and latitude coordinates were estimated for the 50 largest cities in the United States by computational linguistic techniques and by human participants. The mathematical technique Latent Semantic Analysis applied to newspaper texts produced similarity ratings between the 50 cities that allowed for a multidimensional scaling (MDS) of these cities. MDS coordinates correlated with the actual longitude and latitude of these cities, showing that cities that are located together share similar semantic contexts. This finding was replicated using a first-order co-occurrence algorithm. The computational estimates of geographical location as well as population were akin to human estimates. These findings show that language encodes geographical information that language users in turn may use in their understanding of language and the world.
منابع مشابه
Probabilistic Linkage of Persian Record with Missing Data
Extended Abstract. When the comprehensive information about a topic is scattered among two or more data sets, using only one of those data sets would lead to information loss available in other data sets. Hence, it is necessary to integrate scattered information to a comprehensive unique data set. On the other hand, sometimes we are interested in recognition of duplications in a data set. The i...
متن کاملGeographical Estimates are Explained by Perceptual Simulation and Language Statistics
Several studies have demonstrated that language encodes geographical information. That is, the relative longitude and latitude of city locations can be extracted from language. Whether people actually rely on these linguistic features is less clear. Recent studies have suggested that language statistics plays a role in geographical estimates, but these studies rely on map drawings, a fundamenta...
متن کاملControlled Language for Geographical Information System Queries
Natural language interface to spatial databases have not received a lot of attention in computational linguistics, in spite of the potential value of such systems for users of Geographical Information Systems (GISs). This paper presents a controlled language for GIS queries, solves some of the semantic problems for spatial inference in this language, and introduces a system that implements this...
متن کاملManipulations of Graphs with a Visual Query Language: Application to a Geographical Information System
Operators for geographical databases can be classified into two categories: thematic-oriented operators and network-oriented operators. Thematic-oriented operators are based on geometric representations (e.g., intersection, inclusion, adjacency). Network-oriented operators are based on graph manipulations whatever the geometric representation is (e.g., a transitive closure of a graph). In this ...
متن کاملIndexing implicit locations for geographical information retrieval
Local search has become a hot topic recently in information retrieval research area. How to retrieve geographical information correctly and efficiently is a key challenge to location-based search services. In this paper, we present a GIR (geographical information retrieval) system which uses implicit locations to improve retrieval performance. Experimental results based on Geo-CLEF 2006 (a cros...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- Cognitive science
دوره 33 1 شماره
صفحات -
تاریخ انتشار 2009